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Two simple spin models are studied to show that the microcanonical entropy can be a non-concave 
function of the energy, and that the microcanonical and canonical ensembles can give non-equivalent 
descriptions of the same system in the thermodynamic limit. The two models are simple variations 
of the classical paramagnetic spin model of non-interacting spins and are solved as easily as the 
latter model. 
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I. INTRODUCTION 

The goal of this paper is to present two simple spin 
models with the property that their entropy S(U) is non- 
concave; that is, S(U) does not have only one maximum, 
but has several local maxima (Fig. [T]). It is hoped that 
these models will interest not only students in physics, 
but also physicists who have been taught that the entropy 
is always concave. The fact, as will become clear, is that 
there is nothing in the definition of the entropy that pre- 
vents S(U) from being non-concave. What actually de- 
termines the concavity of the entropy is the nature of the 
interactions and, more precisely, their range. If the com- 
ponents (for example, particles or spins) of a many-body 
system interact through short-range interactions (as in 
the nearest- neighbor Ising model), the entropy of such 
a system is concave in the thermodynamic limit. 1 How- 
ever, if the same components interact through long-range 
interactions (as in mean- field models), then the entropy 
can be non-concave in the thermodynamic limits 

Many systems with long-range interactions are known 
to have non-concave entropies^ Examples include sys- 
tems of gravitating particles used to model stars and 
galaxies^£i£ several spin models^ ' 10 ^ 1 ' 12 ! 13 in addi- 
tion to statistical models of fluid turbulence, 14 which 
have been used to describe the Great Red Spot of 
Jupiter 15 among other interesting phenomena. Most of 
these models are difficult to solve analytically and numer- 
ically, which means that they are not really accessible to 
students with only a basic knowledge of statistical me- 
chanics. 

The two spin models presented here were designed to 
overcome this problem. The two were constructed so as 
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FIG. 1: (a) Typical concave entropy S(U). (b) Example of 
non-concave entropy with two local maxima. 



to provide the simplest proof by example of the fact that 
the entropy can be non-concave for long-range interac- 
tions, and may be presented in an introductory course on 
statistical mechanics after presenting the ideal spin model 
of paramagnetism as both are simple variations of that 
model. The price to pay for this simplicity is that neither 
of the proposed models is physically realistic. Yet they 
do well in capturing several interesting features of more 
realistic models having non-concave entropies, including 
those mentioned above. Thus they should serve as good 
toy models for understanding and explaining much of the 
physics observed in real systems. 

Readers who wish to learn more about long-range sys- 
tems and non-concave entropies are encouraged to read 
the collection of papers edited by Dauxois et al.— Other 
accessible references on these subjects will be mentioned 
at the end of the paper. 



II. TWO-STATE MODEL 

The basic model that serves as a template for the two 
variations to be studied here is the paramagnetic spin 
model consisting of N spin-^ particles s\, S2, ■ ■ ■ , sn in- 
teracting with an external magnetic field H ^ If we as- 
sume that the spins do not interact with each other, we 
can write the total energy of the system in the usual form 

N 

U = fxHj2si, (1) 

i=l 

where /i is the magnetic moment of the spins, and Si is 
the spin variable with the value —1 when the ith spin is 
aligned opposite to the magnetic field and si = +1 when 
the spin is aligned with the field. 

The first variation of this model that we consider is 
constructed by assuming that there is a constraint on 
the spins that has the effect of forcing them to all take 
the same state. That constraint is effectively a long- 
range interaction, because it affects all the spins, and can 
be thought of as arising from an infinite ferromagnetic 
interaction acting between the spins or from an infinite 
energy cost assigned to configurations other than the two 
completely- aligned configurations^ For our purposes we 
do not need to specify the nature of the constraint other 
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FIG. 2: Two-state model, (a) Thermodynamic entropy s(u). (b) Thermodynamic free energy f(/3). (c) Equilibrium energy 
per spin u{0) = f'(/3) in the canonical ensemble. 



than just assuming that it is there and that only two 
possible configurations remain, namely, 



and 



Sl = S 2 



Si = s 2 



SN = — 1, 
S N = +1, 



(2a) 
(2b) 



to which are associated two energy values, U — —N and 
U = N, in units where fj,H = 1. The entropy associated 
with the two values of the energy is 



S(U = -N) = S(U = N) = ln(l) = 



(3) 



in units where fee = 1^ We can also write S(U) — 
ln(0) = — oo for all U ^ ±N because there are no 
spin configurations of energy different than ±N with 
the added constraint. Therefore, in the thermodynamic 
limit, we obtain 



are worth noting. The first is that s(u) is a non-concave 
function of u having two maxima corresponding to its 
two finite values. The second is that /(/?) has a jump in 
its derivative at (3 = 0. Because the derivative of the free 
energy of a thermodynamic system yields the equilibrium 
energy of that system as a function of its temperature, 
we expect that the discontinuity of the derivative of f{(3) 
is related to a phase transition. In this case the phase 
transition is trivial: it arises because the energy per spin 
u is discrete and can take only two values. For nega- 
tive temperatures, the equilibrium energy per spin of the 
model is u{(3) = f'(/3) = 1 with all the spins pointing 
in the up direction. As (3 crosses the transition, (3 = 0, 
the energy per spin switches to u((3) = — 1 (the block of 
spins reverses direction), and remains at this value for 
all positive temperatures because /'(/?) = — 1 for (3 > 0. 
The complete behavior of u{(3) is illustrated in Fig. [DJc). 
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s(u) = lim 







if u = ±1 



— oo otherwise 



(4) 



for the entropy per spin expressed in terms of the energy 
per spin u = U/N. Equation Q is the thermodynamic 
entropy of the spin model with the constraint on the spin 
value. 

The free energy of this model is calculated just as eas- 
ily. The partition function is 



Z{(3) 



e 0N + (} -0N 



(5) 



because only two energy values are allowed, each associ- 
ated with only one spin configuration, so that 



F{fi) = - In Z(P) = -ln(e /3JV + 



(0) 



Note that we have omitted the factor of 1/(3 in Eq. ©. 
The free energy per spin f{(3) is found by by taking the 
thermodynamic limit: 



(7) 



The functions s(u) and /(/?) are plotted in Fig. [Ha) 
and^b), respectively. Two properties of these functions 



III. TWO-BLOCK MODEL 

The entropy of the two-state model of Sec. [IT] is non- 
concave as a function of the energy per spin, but in 
a rather singular and contrived way. Is it possible to 
construct a spin model having an entropy which is non- 
concave and continuous? The answer is yes. 

Consider the block of "frozen" spins si, S2, ■ ■ ■ , sat that 
we considered earlier, which are constrained to take 
the same value. We add to this block N new spins 
<7i, (72, ■ ■ ■ , cat which are free to take the values ±1 inde- 
pendently of one another. That is, the spins Oi are non- 
interacting as in the standard paramagnetic spin model. 
Also the s-spins do not interact with the cr-spins, so that 
the total energy of the composite system can be written 
as 



N 



N 



N 



u = u s + u a = Si + a i = Ns i + ^2^ 



(8) 



i = l 



where Sj, <7j = ±1. (We assume that fiH = 1.) The 
minimum value of the total energy corresponding to 
the ground state of the system is now U = —2N, and 
its maximum value, corresponding to the excited en- 
ergy, is U = 2N. For each of these energies, we have 
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u = U/(2N) = ±1 as the minimum and maximum en- 
ergy per spin, respectively. 

The thermodynamic entropy and free energy of the two 
block model can be calculated exactly. The spins Si and 
Ui have only one ground state corresponding to all the 
spins being in the down direction, so that the entropy is 
s(— 1) = 0. Similarly, 

si = s 2 = ■ ■ ■ = s N = o\ = cr 2 = • • • = a N = +1 (9) 

is the only spin configuration which realizes the excited 
energy per spin u = 1, so that s(l) = 0. In the middle of 
the energy spectrum, there are two configurations with 
u = 0, which are obtained by putting all the spins s, in 
one direction and all the spins at in the opposite direc- 
tion. As a result, we must also have s(0) = 0. Hence 
there are three values of u at which s = 0. Moreover, it 
is easy to see that s(u) ^ when u ^ —1, 0, 1, and that 
s(u) is symmetric about u — 0. Therefore, s(u) should 
be a non-concave function of u having two symmetric 
maxima located between three zeros. 

To confirm this guess, let us calculate s(u). Note that 
the number Q(U) of spin configurations of the composite 
system with energy U — U s + U a can be decomposed as 

n(u)=n (r (u tT = u + N) + n a {u (r = u-N), (10) 

where Q a (U a ) is the number of configurations of spins Ui 
with energy U a . These spins form on their own a system 
of TV free spins, the thermodynamic entropy of which is 



1+itV fl + u 
In 



where u £ [—1, 1]. Therefore, we can write 

Sl tr (U a )Me N "<- u -M, (12) 

with sub-exponential corrections in N. If we use this 
approximation in Eq. (|10|) and evaluate the limit 



s(u)= lim — lnfi(2iVu), 



(13) 



we obtain 



s(u) = - max{s CT (2w - 1), s a (2u + 1)} (14) 



or, equivalently, 



s(u) 




Hue [-l,0[ 
if uE [0,1]. 



(15) 



The plot in Fig. [3Ja) shows that this entropy has the 
anticipated form. The presence of the two maxima ob- 
viously make s(u) non-concave, but unlike our previous 
example, s(u) is now continuous in u. 

To understand the exact form of s(u), note that the 
spins Si do not contribute to the entropy of the whole 
system, as is evident from Eq. (fTU]) . The sole function of 
these spins is to give or absorb two macroscopic amounts 
of energy, given by U s — —N and U s = N, respectively. 
In this sense, the spins can be thought as playing the 
role of a reservoir of energy or an "energy switch." When 
U s = —N, the total energy U must be in the range 
[— 2N, 0], so that s(u) for u G [—1,0] is the entropy of 
the spins <7j with 



Ua 

N 



U + N 
N 



= 2u+l 



(16) 



as the energy per spin of the spins o^. Similarly, when 
U s = N, U £ [0,2N], so that s(u) for u £ [0,1] cor- 
responds to the entropy of the spins <7j, the energy per 
spin of which is u a = 2u — 1. The extra factor 1/2 in 
front of s a is present because the entropy per spin of the 
composite system is calculated for 2N spins, not just N. 

We next calculate f((3) by first evaluating the partition 
function 



Sl±l,<Tl,CT2,...,<T A 



JV 



-0a 



S = ±l \<T=±1 / 

= (eP N + e-P N )(eP + e-0) N . 



(17a) 

(17b) 
(17c) 



By taking the limit N — > oo as in Eq. 0, but with N 
replaced by 2N, we obtain 



ln(2cosh/3) 



(18) 
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This result is plotted in Fig. [3jb). As for the previ- 
ous model, f{(3) is non-differentiable at (3 = because 
the term in which corresponds to the free energy 
of the frozen spins is non-differentiable at this point. 
The phase transition originating from the discontinuity 
of f'(/3) = u(j3) is illustrated in Fig. OHc), and can be 
interpreted as before by noting that the block of spins 
Si changes orientation when (3 goes continuously from 
j3 = 0" to /3 = 

IV. NON-EQUIVALENT ENSEMBLES 

By now there should be no doubt that s(u) can be a 
non-concave function. What does it mean physically for 
s(u) to be non-concave? If we are to apply the common 
interpretation of the temperature as given by 

0=-r^ = s'(u), (19) 

then it seems as if the temperature of a system with a 
non-concave entropy varies non-monotonically as a func- 
tion of the internal energy. This interpretation of Eq. (TT9|) 
is valid if we view (3 as nothing but the derivative of the 
entropy. However, if we follow the definition of the canon- 
ical ensemble and view (3 as a parameter fixed externally 
by a heat reservoir, then there is a problem. In this case, 
the equilibrium value u{(3) of the energy per spin, which 
is normally determined by solving Eq. (|19p . must be a 
multi-valued function of (3 if s(u) is non-concave, which 
is not what we see in Fig. EJc). Indeed, apart from the 
point (3 — 0, u({3) takes only one value for each (3 even 
though s(u) is non-concave. Thus it seems that for a 
non-concave entropy, the determination of the equilib- 
rium energy from f3 = s'{u) or from the derivative of 
f((3) yields two different results. What is wrong? 

What is wrong, in a nutshell, is that the fundamen- 
tal relation (|19[) does not only determine the equilibrium 
value u{(3) in the canonical ensemble when s{u) is non- 
concave; it also determines a metastable value for u as 
well as an unstable value. What is seen by taking the 
derivative of f((3) is only the stable root of Eq. (TT9"]) cor- 
responding to the value of u realized at equilibrium in 
the canonical ensemble. 

This explanation may be difficult for students to fol- 
low in a first course on equilibrium statistical mechan- 
ics (however, see Problem 3). A much more transparent 
and related consequence of the non-concavity of the en- 
tropy which can be explained to them is that the ther- 
modynamic description provided by the microcanonical 
ensemble as a function of the energy ceases to be com- 
pletely compatible with the thermodynamic description 
provided by the canonical ensemble as a function of the 
temperature. In short, the two ensembles cease to be 
equivalent. 

The two-block model can be used to illustrate this 
point. If we go back to the behavior of u(f3) observed 
for this model (Fig. [3Jc)), we see that u((3) never takes 



any value in the range (—5, 5) because of the discontinu- 
ity at (3 = 0. In the microcanonical ensemble, the energy 
per spin u can be "tuned" anywhere in the range [— 1 , 1] , 
and to each u G [—1,1] is associated a well-defined value 
of the entropy s(u) (Fig. [3ja)). The two ensembles must 
therefore be non-equivalent because there are values of u 
that can be realized in the microcanonical ensemble, but 
not in the canonical ensemble. In the present case, the 
equilibrium states of the microcanonical ensemble related 
to u G (—5,5) have no equivalent equilibrium states in 
the canonical ensemble because these values of u do not 
show up at equilibrium in the latter ensemble for any 
values of (3; compare Figs.^a) and [31(c). 

Another approach for demonstrating the non- 
equivalence of the microcanonical and canonical ensem- 
bles is to show that the Legendre transform of f{(3) does 
not yield the complete entropy s(u) when the latter is 
non-concaved Although more mathematical in nature, 
this approach works well for the two-block model because 
the Legendre transform of the two differentiable branches 
of f([3) can be calculated directly for this model. The 
result of this calculation 21 , is that the Legendre trans- 
form of f((3) yields the correct expression of s(u) given 
in Eq. (TT5|) for all u G [—1, 1] except u G (— \, ^), which is 
exactly the range of values of u not covered by u(j3) and 
the region of non-equivalent ensembles discussed before. 



V. OUTLOOK AND SUGGESTED PROBLEMS 

The two-state and two-block models can be used to 
explore many other intriguing properties of systems with 
non-concave entropies. Provided below are six prob- 
lems covering some of these properties. The solutions 
of these problems can be found in an extended version 
of this paper j22 in addition to two short introductions to 
the theory of non-concave entropies and non-equivalent 
ensembles! 23 ! 24 For more information about the develop- 
ment of this young theory, the reader is invited to read 
the author's thesis^ and the papers of Thirring,— Hertel— 
and Lyndon-Bell. 7 

Problem 1. Show that the Legendre transform of s(u) 
for either the two-state or the two-block model yields 
the correct f((3) for all (3. There is a difficulty here: 
because s(u) is non-concave for both models, how should 
the Legendre transform be defined? The answer is 

f(f3) = mm{l3u - s(u)}, (20) 

u 

where "min" stands for "minimum of." This transform 
is called a Legendre-Fenchel transform. 

Problem 2. Prove in general that if s(u) is non-concave, 
then /(/3) must have one or more non-differentiable 
points. In other words, prove that a non-concave entropy 
in the microcanonical ensemble implies a discontinuous 
or first-order phase transition in the canonical ensemble. 

Problem 3. Show that the distribution of the energy 
per particle u = U/N in the canonical ensemble, given 
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by 



-PNu 



-N[f3u-s(u)-f(f3)] 



HP) 



(21) 



has more than one maximum when s(u) is non-concave. 
Show also that the positions of the various maxima of 
Ppiu) are given for N — * oo by Eq. (fT!?)) . Which of 
these maxima corresponds to the equilibrium value of ul 
Which is the metastable value? Does Pp{u) have more 
than one maximum when s(u) is concave? 

Problem 4- Show that a system with a non-concave 
entropy can have values of u at which its heat capacity, 
calculated in the microcanonical ensemble for fixed values 
of u, is negative. Can the heat capacity be negative in 
the canonical ensemble? 

Problem 5. Show that you can recover the full non- 
concave entropy of the two-state or two-block model by 
calculating the Legendre transform of the Gaussian free 
energy j 1 (/?) associated with the following Gaussian par- 
tition function: 



-0U-jU 2 



(22) 



How should 7 be chosen? [The calculation of Z^((3) for 
a model similar to the two-state model can be found 
in Ref. [26]. The theory of Gaussian free energies and 
their relation to non-concave entropies can be found in 
Ref. 

Problem 6. Given that the microcanonical and canon- 
ical ensembles are non-equivalent for systems with non- 
concave entropies, under which conditions or principles 
should we use one ensemble or the other to describe such 
systems? Is one of the two ensembles more "physical" or 
more "natural" than the other? 
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